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■^r^Sign.a)  detection  in  a  large  segment  of  non- 
Gaussian  and  non-stationary  .Arctic  under-ice  noise, 
which  contains  both  high  power  narrow-band  and 
impulsive  components,  is  examined.  It  is  show  n  that 
the  correlation  functions  of  sui>-5egn)ents  of  data 
change  significantly,  and  if  ignored,  can  degrade  tli' 
performance  of  a  detector.  For  a  false  alarm  proba- 
bility  of  0.05  and  a  known  constant  signal,  the 
matched  filter  wa.s  on  average  17. better  tlian  a 
detector  designed  a-ssuming  independent  noise  sam¬ 
ples.  It  is  alsc'  shown  that  pre-processing  the  data 
with  an  adaptive  notch  filler,  then  using  the  matched 
flli'-r  will  result  in  a  further  improvement  of  about 
C^.  Additionally,  the  effect  two  different  signal 
sliapes  have  on  the  performance  of  tlie  matched  filter 
is  examined. 
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With  the  goal  of  creatitig  a  databa.se  of  under-ice 
noise,  in  1080  a  mitl'i-insiiiutlon.al  rescarcli  program  colla¬ 
borated  to  record  .Arctic  under-ice  data.  The  data.  PRAM 
If.  was  recorded  on  23-2-1  .April  1980.  from  a  pack  ice  floe, 
at  8G'.V  25'' IP.  .An  omnidirectional  hydrophone,  radio 
linked  to  a  receiver,  was  suspended  to  a  depth  of  91m  in 
4000m  deep  water.  The  data,  recorded  on  an  analog  device, 
was  then  bandpass  filtered  from  0.01-5kHz.  The  particular 
data  segment  being  analyzed  wa®  recorded  on  the  23  .April 
from  11:30  to  11:-10  pm.  Subsequent  to  the  bandpass  filter 
previously  mentioned,  the  data  was  passed  through  a  low 
pass  filter  w  ith  cutoff  of  2.5  kHz.  and  digitized  with  a  sam¬ 
pling  frequency  of  10  kHz.  .A  total  of  G.150.1-1-1  data  sam¬ 
ples  were  taken,  which  conveniently  breaks  down  to  6000 
records  with  each  record  containing  exacilv  102-1  samples 
11.2]. 

Several  authors  hare  looked  at  this  particular  data 
secnient  in  varving  d'-grec-  of  detail.  Dwyer  [l‘  took  <-ev(  r:i! 
different  approaches  tci  analyzing  the  under-ice  dai.a.  He 
examined  the  first  four  central  moments  in  the  time 
domain  and  concluded  that  the  noise  is  non-stationary  and 
non-Gaussian.  A  similar  analysis  carried  out  in  tlie  fre¬ 
quency  domain  yielded  comparal'lc  results.  The  energy 
spectrum  indicated  that  the  ice  noise  was  heavier  tailed 
tlian  a  Gaussian  source  processed  in  tlte  same  fasliion.  in 
agreement  with  the  results  of  Greene  and  Buck  |3j.  From 
an  examination  of  tlie  spectrum  and  tli'-  tlim-  domain  plots 
of  tlie  data,  bcjth  narrow-band  and  impulsive  romi>oi)eiit' 


were  identified.  The  existence  of  these  comptonenls  was 
not  surprising,  since  the  floe  was  moving  slowly  throughout 
the  experiment.  .As  suggested  by  the  work  of  Milne  and 
Ganton  [qj.  the  movement  of  the  floe  would  cause  rifting 
and  cracking  of  the  ice. 

Witch  and  Wilks  [2]  texak  a  slightly  different  approach 
since  their  objective  was  to  develop  a  statistical  model  for 
the  under-ice  noise.  Their  work  confirms  the  presence  of 
narrow-band  and  impulsive  components,  but  suggests  that 
tlie  background  noise  is  Gaussian.  By  examining  the  first 
four  central  moments  of  the  data  in  detail,  they  noted  a 
strong  correlation  between  records  with  large  kurtosis  and 
large  skew  .  They  also  determined  that  a  large  kurtosis  was 
associated  with  high  amplitude  bursts.  In  addition,  it  was 
proposed  that  the  .Arctic  data  could  be  modeled  in  the  fre¬ 
quency  domain  as  a  mixture  of  a  Gaussian  background  pro¬ 
cess.  a  sum  of  a  random  number  of  narrow-band  com¬ 
ponents.  each  with  random  piia.se  and  amplitude,  and  what 
is.  in  some  sense,  an  error  term.  The  main  drawbacks  of 
the  model  are  tliat  the  impulsive  component,  which  has 
been  verified  as  being  present,  is  not  modeled  specifically. 
.Also  the  w  eights  and  number  of  sinusoidal  components  are 
not  modeled  constructively;  that  is.  for  some  arbitrary  seg¬ 
ment  of  the  data,  the  number  of  tones  is  unknown. 

Recently.  Nielsen  and  Thomas  (5]  proposed  that  the 
univariate  statistics  of  the  Arctic  data  could  be  modeled  by 
the  generalized  Gaussian  family,  with  the  parameters  of  the 
model  estimated  from  the  data.  The  model  was  shown  to 
be  a  reasonablx-  accurate  representation  of  the  empirical 
probability  mass  functions  for  blocks  (5  records)  of  data, 
but  was  not  accurate  enough  for  the  resulting  Neyman- 
Pearson  optimal  detector  to  show  an  improvement  over  a 
simple  linear  structure. 


Let  a  sample  function  of  a  continuous  parameter  ran¬ 
dom  process  be  sampled  at  the  Nyquist  frequency,  yielding 
samples  j,.  Define  tlie  time  correlation  functions  as 
1 

■'  1-1 

Equivalent  1>'.  for  any  realization  j_.  the  Fourier  transform  of 
the  realization  can  be  used  to  compute  the  power  spectrum, 
which  can  then  be  inverse  Fourier  transformed  to  obtain  an 
estimate  of  the  time  correlation  function.  The  Arctic  data 
was  sampled  at  10  kHz,  which  is  twice  the  Nyquist  fre¬ 
quency.  For  these  coni  put  at  ion^  the  data  was  decimated  by 
a  factor  of  two  so  that  the  sampling  frequency  was  equal  to 
the  Nyquist  frequency;  thus,  the  correlation  functions  were 
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computed  using  A' =>12. 

Since  time  correlation  functions  for  real  processe.s  are 
even  about  the  origin.  <'=0,  the  graphs  of  typical  correla- 
tioi.  functioits.  shown  in  Figures  I  -  d.  are  plotted  only  for 
1.  2 .  256.  In  general,  it  wa.s  found  that  the  correla¬ 

tion  function  took  on  one  of  four  different  shapes.  The 
time  correlation  function  of  record  0001.  Figure  1,  shows  a 
mixture  of  a  decaying  low  frequency  component  with  a 
small  amplitude  high  frequency  comr«3nent.  The  main 
variatioas  for  this  type  of  function  are  the  rate  of  decay 
and  the  amplitude  of  the  high  frequency  component. 

"i  I 

P(K)  I 


-0.5; 


-I.Oi _ I _ i _ i _ : _ j) 

0  50  100  150  200  250 


k 

Figure  1  -  Time  Correlation  Record  0901 


Figttre  2  shows  the  correlation  functiot)  for  record 
0905.  approximately  one-half  second  later  than  record  0901. 
The  correlation  drops  rapidl.v  a.®  k-  increases,  and  th^n  has 
small  amplitude  variatioas  about  zero.  .A  close  examination 
of  this  correlation  function  shows  that  significant  correla¬ 
tions  exist  for  the  first  three  shifts.  The  drastic  change  in 
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I'  igure  2  -  Time  Correlation  Record  0905 


the  shape  of  the  correlation  function  in  a  short  period  of 
time,  as  illustrated  by  Figures  1  and  2.  is  not  unusual  for 
this  data. 

The  third  typical  correlation  function  is  shown  in  Fig¬ 
ure  3.  For  record  1205  the  correlation  is  a  mixture  of  a  low 
frequency  and  a  high  frequency  component.  The  result  is 
that  the  correlation  function  loolo^  like  a  corrupted  AM  sig¬ 
nal.  The  principal  variations  with  the  general  shape  are  in 
the  magnitude  of  the  correlation  for  small  k,  and  the  decay 
rate  of  the  high  frequency  component. 
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Figure  3  -  Time  Correlation  Record  1205 

One  of  the  most  interesting  correlation  functions 
belongs  to  the  final  category.  In  Figure  4  the  correlation 
function  for  data  record  3905  is  shown.  The  correlation  for 
this  data  is  very  high,  and  periodic,  with  peak  correlations 
greater  than  0.5  for  shifts  greater  than  zero.  The  peak 
correlation  for  shifts  other  than  zero  were  found  to  be  plus 
or  minus  one  for  some  data  records,  and  the  envelope  shape 
varied  greatly.  The  main  factor  correlation  functions  of 
this  type  have  in  common  is  the  presence  of  a  single,  high 
powered  periodic  component. 

Adaptive  Nntcb  Filter 

The  correlation  between  data  samples,  which  has  been 
shown  to  be  significant  in  the  Arctic  under-ice  noise,  pro¬ 
vides  a  complicating  factor  in  the  problem  of  signal  detec¬ 
tion.  This  results  primarily  from  inability  to  model  the 
multivariate  density  if  the  data  is  known  to  be  non- 
Gaussian.  A  common  approach  to  working  with  highly 
correlated  noise  is  to  pre-whiten  the  data,  ideally  resulting 
in  independent  noise  samples.  Sub-optimal  pre-whiteners, 
which  are  designed  to  reduce  instead  of  eliminate  the  corre¬ 
lation  between  data  samples,  are  used  with  real  noise 
sources. 

For  environments  such  as  the  Arctic  data,  which  con¬ 
tain  a  combination  of  narrow-band  and  wideband  processes, 
an  adaptive  line  enhancer  [6]  is  commonly  used  to  separate 
the  two  processes.  Suppose  the  received  vector  1'  is  the 
sum  of  two  components;  a  white  background  component  A’ 
and  a  sinusoidal  component  5.  We  can  represent  this 
mathematically  as 
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Figure  4  -  Time  Correlation  Record  3905 


)•  =  ,V  +  >• 

In  Figure  o  we  design  the  filter  to  be  an  adaptive  notch 
filler  (ANT)  with  a  gain  of  unity  everj'where  except  at  a 
specified  band  of  frequencies,  where  the  gain  zero.  The 


Figure  5  -  Adaptive  Line  Enhancer 


filter  output  will  be  the  white  noise  .V  and  the  output  of 
the  difference  operator  will  be  the  sinusoid  5.  We  define 
the  ANT  as  follows  [7) 


H(z) 


f-  -f  6.-  +  1 

r  +  -lA.'  +  r 


Then.  we  have 


bandwidth  ~  2(1  — ‘j).  ")e(0,l) 

normalized  center  frequency  s:  cos~'(— 6/2).  6  e(— 2.2] 

For  an  input  sequence  y*  and  output  sequence  of  the  ANT 
u*.  we  have 

u'i  -  y*  +  6y*_|  +  -  6'ya>_,  - 

For  a  fixed  notch  width,  the  only  adaptive  parameter  of  the 
filter  is  6.  which  controls  the  center  frequency  of  the  notch. 
To  deri\e  the  update  equation  for  6.  we  minimize  using 
a  gradient  search  algorithm,  then 

>>k+\  “  +  step  tc*  (yt_,  -  q  ti>_i) 

The  variable  step,  which  can  be  either  fixed  or  adaptively 
set,  controls  the  adaptation  speed  of  the  filter  and  the  vari¬ 
ance  of  the  filter  center  frequency  once  steady  state  ha.« 
been  achieved.  That  is.  a  value  of  step  too  small  will  result 
in  a  long  time  for  the  filter  to  adjust  to  the  proper  center 
frequency,  however,  once  there,  only  small  changes  in  the 


filter  center  frequency  will  occur.  The  converse  results  will 
hold  for  a  value  of  step  too  large.  One  advantage  of  the 
ant  is  that  it  can  be  implemented  easily  as  a  time  domain 
processer,  in  contrast  with  the  frequency  domain  pre¬ 
processor  proposed  by  Dwyer  [8], 

As  an  example  of  the  effect  the  ANT  can  have  on  the 
correlation  function.  Figures  6-7  show  the  correlation 
function  for  record  4201  before  and  after  filtering.  The 
ANT  essentially  removes  the  high  frequency  component 
which  dominates  the  correlation  function,  resulting  in  data 
which  is  much  less  correlated.  The  results  are  not  always 
as  dramatic  as  with  record  4201.  Specifically,  if  the  data 
contains  no  dominant  narrow-band  component,  little  differ¬ 
ence  in  the  correlation  function  after  filtering  is  observed. 
Also,  when  the  data  contains  multiple  narrow-band  com¬ 
ponents.  the  ANT  only  removes  the  highest  powered;  thus 
the  filtered  data  still  has  a  correlation  function  with 
periodic  components. 


Figure  6  -  Time  Correlation  Record  4201 


Figure  7  -  Time  Correlation  Filtered  Record  4201 
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Consider  tlie  I'ollowing  simple  dcteciion  problem.  Let 
M  lie  tlie  number  of  sanii'le>  taken  per  decision,  and  let  \ 
he  tlie  M-vector  of  samples  from  the  zero  mean  noise  pro¬ 
cess  and  s  be  the  known  signal  M-vector.  Let  /y  represent 
the  M-variate  density  of  the  noise.  Define  a  binary 
hypothesis  problem  in  the  following  manner 


Ho:  X  =  \ 

//i:  i=A-fs 

w  here  X  is  the  vector  received  by  the  dctectoi  and  is  of 


length  M.  Let 

a  =  Probability  (  decide  Hi  |  Hq  Irve) 

.?  =  Probability  (  decide  Hi  |  //j  true) 

The  Neyman-Pearson  criterion  can  be  written  as  follows. 
Fix  some  Oo-  Define  the  optimal  detector  to  be  the  detec¬ 
tor  which  ma.\imi2es  the  power  3  for  an\  a  <  Oo-  Define 


the  likelihood  ratio  I  as  follows 


l(X)  =  log 


fsiiX-^) 


hf(X) 


Then  the  optimal  detector  can  be  written  a.s 
lU)  >  T  -*  Hi 
<  T  Hq 

The  threshold  T  is  chosen  to  satisfy  tlie  constraints  on  the 


power  iand  on  the  level  q  [9j. 


.\.s  mentioned  before,  for  non-Gaussian  data  which  is 
correlated,  estimating  the  multivariate  density  of  the  noise 
may  not  be  possible.  One  approach  taken  to  circumvent 
tliis  problem  is  to  a-ssume  that  the  multivariate  statistics 
are  Gaussian;  then  asiiig  tlie  Neyni.  .. -Pearson  criteria,  the 
optimal  likelihood  ratio  is  the  matched  filler.  If  R  is  the 
M  X  M  covariance  matrix  of  the  noise,  then  the  matched 
filter  is  defined  as 

l{x)  =  JR-h 


For  independent  noise  samples,  flA’i.V,  =  6,,<r  and  the 
covariance  matrix  R  <=cr  }.  where  I  is  an  M  x  M  identity 
matrix.  Consequently,  the  likelihood  ratio  reduces  to 
^(i)  The  resulting  detector  is  then  a  correlator. 

In  addition  only  the  variance  of  the  noise  cr  need  be 
estimated. 

For  correlated  data  the  estimation  of  the  elements  of 
the  covariance  matrix  is  necessary.  WTiile  the  Arctic  noise 
does  not  appear  to  be  stationary  in  any  sense,  we  will 
assume  that  within  a  record  (that  is  1024  contiguous  sam¬ 
ples)  the  data  is  at  least  wide  sense  stationary  and  ergodic. 
and  that  the  mean  and  variance  of  the  noise  are  known 
exactly.  L'nder  the  assumption  of  ergodicity.  the  correla¬ 
tion  function  is  equivalent  to  a  vector  of  expected  values. 
Pk  ’=  EXjXj^.k.  Define  the  covariance  matrix  of  the  noise 
as 


R, 


-  P,. 

=1024  samples:  thus  P*  is  com- 


/?,;  =  EX.X,  = 

Each  record  contains  A'« 
puted  for  1,  1023.  For  a  signal  length  of  M.  only 

the  first  M  correlations  are  used  in  the  formation  of  R. 
The  resulting  covariance  matrix  R  is  symimetrlc  and  Toe- 
plitz.  For  f?  to  be  a  valid  invertible  covariance  matrix,  we 


require  that  it  be  positive  definite. 

For  convenience,  the  data  was  normalized  such  that 
Pq  =  1.  Thus,  the  covariance  matrix  is  a^R.  where  cr  is 
the  variance  of  the  record  and  R  is  the  normalized  covari¬ 
ance  matrix.  Henceforth,  we  refer  to  the  normalized  covari¬ 
ance  matrix  as  the  covariance  matrix.  A  signal  length  of 
Af  =  4  was  used  for  all  simulations.  For  the  entire  data 
set.  6006  covariance  matrices  were  computed.  The 
minimum  eigenvalue  was  always  strictly  greater  than  zero, 
insuring  that  R  was  indeed  positive  definite,  and  typical 
values  for  the  four  eigenvalues  were  0.0077,  0.163,  1.216. 
and  2.581.  In  addition,  a  comparison  of  covariance 
matrices  showed  that  the  elements  changed  slowly  over 
time.  This  suggests  the  assumptions  of  wide  sense  stationar- 
ity  and  ergodicity  are  not  unreasonable  for  such  a  short 
segment  of  data  (I  record  ~0.]  seconds). 

To  determine  the  effect  that  correlation  between  data 
samples  has  on  the  signal  detection  problem,  the  power  0 
for  the  following  detectors  was  compared  when  A/=4. 

Dl  Correlator  (/?  =  I) 

D2  Matched  Filter 

D3  ANT  -  Matched  Filter 

For  jD]]-[D3].  in  order  to  compensate  for  the  non- 
stationary  characteristic  of  the  noise,  parameters  were 
estimated  on  a  record-by-record  basis.  For  (Dlj,  was 
estimated;  for  [D2]-[D3].  the  covariance  matrix  R  w’as 

estimated,  using  the  method  described  previously.  In  par¬ 
ticular.  for  [D3].  the  ,\NT  was  used  as  a  preprocessor,  and 
the  covariance  matrix  was  estimated  using  the  filtered 
data.  For  convenience  the  data  was  grouped  into  blocks  of 
20  records.  Thus  each  block  contained  20480  data  samples, 
and  represented  a  time  interval  of  about  2  seconds.  We 
had  a  total  of  300  blocks,  and  the  performance  was 
evaluated  for  each  block. 

In  Table  1.  the  mean  and  variance  of  the  ratio  of  the 
power  levels  for  (Dl)  to  [D2]  are  shown,  along  with  the 
minimum  and  maximum  values  of  the  ratio,  and  the  perfor¬ 
mance  improvement.  We  defined  the  performance  improve¬ 
ment  to  be 

I(Dj,  D.>  X  100 

The  results  for  detectors  [D2]  and  [D3]  are  presented  in 
Table  2.  Examining  Tables  1  and  2,  we  se  that  the  largest 
improvements  in  performance  are  for  a«0.05;  and  while 
the  performance  improvement  decreases  as  o  increases,  the 
improvement  is  still  significant  for  a  =  0.20.  For  a  —  0.05. 
[D2)  is  approximately  17.6%  better  and  [D3]  is  approxi¬ 
mately  29.2%  better,  on  average,  than  pi).  For  Q  —  0.20. 
the  improvements  are  smaller,  but  at  12.4%  and  19.3%, 
they  are  still  significant. 

Given  that  the  Arctic  data  is  highly  correlated  at 
times,  the  improvement  that  the  matched  filter  has  over 
the  simple  correlator  is  understandable.  The  additional 
improvement  which  occurs  w  hen  the  ANT  is  used  as  a  pre¬ 
processor  is  not  as  clear.  It  is  well  known  that  the  output 
of  a  linear  recursive  ri!t''r  is  more  Gaussian  than  the  input. 
Thus  using  the  ANT  as  a  pre-processor  not  only  whitens 
the  data,  by  removing  the  highest  powered  periodic  com¬ 
ponent.  but  also  results  in  data  which  is  more  Gaussian.  It 
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Table  1 

Ratio  of  Power  Levels  between  [D2j  and  [D3] 


Table  2 


is  clear  from  these  results,  that  disregarding  the  correlation 
between  data  samples  will  result  in  a  performance  degrada¬ 
tion. 

The  minimum  and  maximum  values  of  the  ratios  are 
also  interesting.  While  the  maximum  values  of  the  ratio 
are  near  unity,  indicating  that  the  detectors  have  similar 
performance,  the  minimum  values  range  from  0..58  to  0.7.5. 
indicating  that  under  some  conditions  one  detector  is  much 
better  th.an  the  other.  In  particular,  blocks  80-90  ,  d 
170-210  can  be  shown  by  an  analysis  of  their  correlation 
function-s  to  contain  periodic  components.  For  these  cases. 
jD2;  is  about  25‘^f  better  and  [D3]  is  approximately  ■47''7 
better  than  [Dl]  for  q  =  0.20.  Figures  8-9  are  plots  of 
^D\  sintl  for  o  =0.20  to  illustrate  more 

clearly  the  improvements. 

One  may  challenge  the  validity  of  using  the  same  data 
to  estimate  the  covariance  matrix  and  compute  the  detec¬ 
tion  probabilities.  Tests  were  run  where  the  covariance 
matrix  was  updated  every  record,  every  fifth  record,  every 
tenth  record,  and  every  twentieth  record.  It  was  found 
that  the  performance  levels  did  not  change  appreciably  for 
updates  occurring  as  slowly  as  every  ten  records.  If  the 
update  was  performed  every  twenty  records,  once  a  block, 
only  a  small  performance  loss  occurred. 

For  all  computations,  a  constant  signal  was  used  such 
that  the  sum  of  the  elements  of  the  signal  vector  equaled 
the  noise  standard  deviation,  i.e.  input  S/N=l.  For  the 
matched  filter  [D2j.  the  output  S/.N  can  be  written 
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Figure  8  -  Ratio  of  Power  Levels  between  |Dl]  and  [D2| 
for  a  =  0.20 
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For  any  positive  definite  matrix  A  such  that 

are  the  minimum  and  maximum  eigenvalues  of  .4.  b> 

Rayleigh's  principle  we  have  (10[ 
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Figuie  9  -  Ratio  of  Power  Levels  between  [02]  and  [D3j 
for  Q  =  0.20 


